53 research outputs found
Mining time-series data using discriminative subsequences
Time-series data is abundant, and must be analysed to extract usable knowledge. Local-shape-based methods offer improved performance for many problems, and a
comprehensible method of understanding both data and models.
For time-series classification, we transform the data into a local-shape space using a shapelet transform. A shapelet is a time-series subsequence that is discriminative
of the class of the original series. We use a heterogeneous ensemble classifier on the transformed data. The accuracy of our method is significantly better than the time-series classification benchmark (1-nearest-neighbour with dynamic time-warping distance), and significantly better than the previous best shapelet-based classifiers.
We use two methods to increase interpretability: First, we cluster the shapelets using a novel, parameterless clustering method based on Minimum Description Length,
reducing dimensionality and removing duplicate shapelets. Second, we transform the shapelet data into binary data reflecting the presence or absence of particular
shapelets, a representation that is straightforward to interpret and understand.
We supplement the ensemble classifier with partial classifocation. We generate rule sets on the binary-shapelet data, improving performance on certain classes, and revealing the relationship between the shapelets and the class label. To aid interpretability, we use a novel algorithm, BruteSuppression, that can substantially reduce
the size of a rule set without negatively affecting performance, leading to a more compact, comprehensible model.
Finally, we propose three novel algorithms for unsupervised mining of approximately repeated patterns in time-series data, testing their performance in terms of
speed and accuracy on synthetic data, and on a real-world electricity-consumption device-disambiguation problem. We show that individual devices can be found automatically
and in an unsupervised manner using a local-shape-based approach
HER2-enriched subtype and novel molecular subgroups drive aromatase inhibitor resistance and an increased risk of relapse in early ER+/HER2+ breast cancer
BACKGROUND: Oestrogen receptor positive/ human epidermal growth factor receptor positive (ER+/HER2+) breast cancers (BCs) are less responsive to endocrine therapy than ER+/HER2- tumours. Mechanisms underpinning the differential behaviour of ER+HER2+ tumours are poorly characterised. Our aim was to identify biomarkers of response to 2 weeks’ presurgical AI treatment in ER+/HER2+ BCs. METHODS: All available ER+/HER2+ BC baseline tumours (n=342) in the POETIC trial were gene expression profiled using BC360™ (NanoString) covering intrinsic subtypes and 46 key biological signatures. Early response to AI was assessed by changes in Ki67 expression and residual Ki67 at 2 weeks (Ki672wk). Time-To-Recurrence (TTR) was estimated using Kaplan-Meier methods and Cox models adjusted for standard clinicopathological variables. New molecular subgroups (MS) were identified using consensus clustering. FINDINGS: HER2-enriched (HER2-E) subtype BCs (44.7% of the total) showed poorer Ki67 response and higher Ki672wk (p<0.0001) than non-HER2-E BCs. High expression of ERBB2 expression, homologous recombination deficiency (HRD) and TP53 mutational score were associated with poor response and immune-related signatures with High Ki672wk. Five new MS that were associated with differential response to AI were identified. HER2-E had significantly poorer TTR compared to Luminal BCs (HR 2.55, 95% CI 1.14–5.69; p=0.0222). The new MS were independent predictors of TTR, adding significant value beyond intrinsic subtypes. INTERPRETATION: Our results show HER2-E as a standardised biomarker associated with poor response to AI and worse outcome in ER+/HER2+. HRD, TP53 mutational score and immune-tumour tolerance are predictive biomarkers for poor response to AI. Lastly, novel MS identify additional non-HER2-E tumours not responding to AI with an increased risk of relapse
Multiple novel prostate cancer susceptibility signals identified by fine-mapping of known risk loci among Europeans
Genome-wide association studies (GWAS) have identified numerous common prostate cancer (PrCa) susceptibility loci. We have
fine-mapped 64 GWAS regions known at the conclusion of the iCOGS study using large-scale genotyping and imputation in
25 723 PrCa cases and 26 274 controls of European ancestry. We detected evidence for multiple independent signals at 16
regions, 12 of which contained additional newly identified significant associations. A single signal comprising a spectrum of
correlated variation was observed at 39 regions; 35 of which are now described by a novel more significantly associated lead SNP,
while the originally reported variant remained as the lead SNP only in 4 regions. We also confirmed two association signals in
Europeans that had been previously reported only in East-Asian GWAS. Based on statistical evidence and linkage disequilibrium
(LD) structure, we have curated and narrowed down the list of the most likely candidate causal variants for each region.
Functional annotation using data from ENCODE filtered for PrCa cell lines and eQTL analysis demonstrated significant
enrichment for overlap with bio-features within this set. By incorporating the novel risk variants identified here alongside the
refined data for existing association signals, we estimate that these loci now explain ∼38.9% of the familial relative risk of PrCa,
an 8.9% improvement over the previously reported GWAS tag SNPs. This suggests that a significant fraction of the heritability of
PrCa may have been hidden during the discovery phase of GWAS, in particular due to the presence of multiple independent
signals within the same regio
Mutations in the histone methyltransferase gene KMT2B cause complex early-onset dystonia.
Histone lysine methylation, mediated by mixed-lineage leukemia (MLL) proteins, is now known to be critical in the regulation of gene expression, genomic stability, cell cycle and nuclear architecture. Despite MLL proteins being postulated as essential for normal development, little is known about the specific functions of the different MLL lysine methyltransferases. Here we report heterozygous variants in the gene KMT2B (also known as MLL4) in 27 unrelated individuals with a complex progressive childhood-onset dystonia, often associated with a typical facial appearance and characteristic brain magnetic resonance imaging findings. Over time, the majority of affected individuals developed prominent cervical, cranial and laryngeal dystonia. Marked clinical benefit, including the restoration of independent ambulation in some cases, was observed following deep brain stimulation (DBS). These findings highlight a clinically recognizable and potentially treatable form of genetic dystonia, demonstrating the crucial role of KMT2B in the physiological control of voluntary movement.Funding for the project was provided by the Wellcome Trust for UK10K (WT091310) and DDD Study. The DDD study presents independent research commissioned by the Health Innovation Challenge Fund [grant number HICF-1009-003] - see www.ddduk.org/access.html for full acknowledgement. This work was supported in part by the Intramural Research Program of the National Human Genome Research Institute and the Common Fund, NIH Office of the Director. This work was supported in part by the German Ministry of Research and Education (grant nos. 01GS08160 and 01GS08167; German Mental Retardation Network) as part of the National Genome Research Network to A.R. and D.W. and by the Deutsche Forschungsgemeinschaft (AB393/2-2) to A.R. Brain expression data was provided by the UK Human Brain Expression Consortium (UKBEC), which comprises John A. Hardy, Mina Ryten, Michael Weale, Daniah Trabzuni, Adaikalavan Ramasamy, Colin Smith and Robert Walker, affiliated with UCL Institute of Neurology (J.H., M.R., D.T.), King’s College London (M.R., M.W., A.R.) and the University of Edinburgh (C.S., R.W.)
Breast cancer risk variants at 6q25 display different phenotype associations and regulate ESR1, RMND1 and CCDC170.
We analyzed 3,872 common genetic variants across the ESR1 locus (encoding estrogen receptor α) in 118,816 subjects from three international consortia. We found evidence for at least five independent causal variants, each associated with different phenotype sets, including estrogen receptor (ER(+) or ER(-)) and human ERBB2 (HER2(+) or HER2(-)) tumor subtypes, mammographic density and tumor grade. The best candidate causal variants for ER(-) tumors lie in four separate enhancer elements, and their risk alleles reduce expression of ESR1, RMND1 and CCDC170, whereas the risk alleles of the strongest candidates for the remaining independent causal variant disrupt a silencer element and putatively increase ESR1 and RMND1 expression.This is the author accepted manuscript. The final version is available from Nature Publishing Group via http://dx.doi.org/10.1038/ng.352
Recommended from our members
Effect of Hydrocortisone on Mortality and Organ Support in Patients With Severe COVID-19: The REMAP-CAP COVID-19 Corticosteroid Domain Randomized Clinical Trial.
Importance: Evidence regarding corticosteroid use for severe coronavirus disease 2019 (COVID-19) is limited. Objective: To determine whether hydrocortisone improves outcome for patients with severe COVID-19. Design, Setting, and Participants: An ongoing adaptive platform trial testing multiple interventions within multiple therapeutic domains, for example, antiviral agents, corticosteroids, or immunoglobulin. Between March 9 and June 17, 2020, 614 adult patients with suspected or confirmed COVID-19 were enrolled and randomized within at least 1 domain following admission to an intensive care unit (ICU) for respiratory or cardiovascular organ support at 121 sites in 8 countries. Of these, 403 were randomized to open-label interventions within the corticosteroid domain. The domain was halted after results from another trial were released. Follow-up ended August 12, 2020. Interventions: The corticosteroid domain randomized participants to a fixed 7-day course of intravenous hydrocortisone (50 mg or 100 mg every 6 hours) (n = 143), a shock-dependent course (50 mg every 6 hours when shock was clinically evident) (n = 152), or no hydrocortisone (n = 108). Main Outcomes and Measures: The primary end point was organ support-free days (days alive and free of ICU-based respiratory or cardiovascular support) within 21 days, where patients who died were assigned -1 day. The primary analysis was a bayesian cumulative logistic model that included all patients enrolled with severe COVID-19, adjusting for age, sex, site, region, time, assignment to interventions within other domains, and domain and intervention eligibility. Superiority was defined as the posterior probability of an odds ratio greater than 1 (threshold for trial conclusion of superiority >99%). Results: After excluding 19 participants who withdrew consent, there were 384 patients (mean age, 60 years; 29% female) randomized to the fixed-dose (n = 137), shock-dependent (n = 146), and no (n = 101) hydrocortisone groups; 379 (99%) completed the study and were included in the analysis. The mean age for the 3 groups ranged between 59.5 and 60.4 years; most patients were male (range, 70.6%-71.5%); mean body mass index ranged between 29.7 and 30.9; and patients receiving mechanical ventilation ranged between 50.0% and 63.5%. For the fixed-dose, shock-dependent, and no hydrocortisone groups, respectively, the median organ support-free days were 0 (IQR, -1 to 15), 0 (IQR, -1 to 13), and 0 (-1 to 11) days (composed of 30%, 26%, and 33% mortality rates and 11.5, 9.5, and 6 median organ support-free days among survivors). The median adjusted odds ratio and bayesian probability of superiority were 1.43 (95% credible interval, 0.91-2.27) and 93% for fixed-dose hydrocortisone, respectively, and were 1.22 (95% credible interval, 0.76-1.94) and 80% for shock-dependent hydrocortisone compared with no hydrocortisone. Serious adverse events were reported in 4 (3%), 5 (3%), and 1 (1%) patients in the fixed-dose, shock-dependent, and no hydrocortisone groups, respectively. Conclusions and Relevance: Among patients with severe COVID-19, treatment with a 7-day fixed-dose course of hydrocortisone or shock-dependent dosing of hydrocortisone, compared with no hydrocortisone, resulted in 93% and 80% probabilities of superiority with regard to the odds of improvement in organ support-free days within 21 days. However, the trial was stopped early and no treatment strategy met prespecified criteria for statistical superiority, precluding definitive conclusions. Trial Registration: ClinicalTrials.gov Identifier: NCT02735707
Recommended from our members
Report on computational assessment of Tumor Infiltrating Lymphocytes from the International Immuno-Oncology Biomarker Working Group
Funder: U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)Funder: National Center for Research Resources under award number 1 C06 RR12463-01, VA Merit Review Award IBX004121A from the United States Department of Veterans Affairs Biomedical Laboratory Research and Development Service, the DOD Prostate Cancer Idea Development Award (W81XWH-15-1-0558), the DOD Lung Cancer Investigator-Initiated Translational Research Award (W81XWH-18-1-0440), the DOD Peer Reviewed Cancer Research Program (W81XWH-16-1-0329), the Ohio Third Frontier Technology Validation Fund, the Wallace H. Coulter Foundation Program in the Department of Biomedical Engineering and the Clinical and Translational Science Award Program (CTSA) at Case Western Reserve University.Funder: Susan G Komen Foundation (CCR CCR18547966) and a Young Investigator Grant from the Breast Cancer Alliance.Funder: The Canadian Cancer SocietyFunder: Breast Cancer Research Foundation (BCRF), Grant No. 17-194Abstract: Assessment of tumor-infiltrating lymphocytes (TILs) is increasingly recognized as an integral part of the prognostic workflow in triple-negative (TNBC) and HER2-positive breast cancer, as well as many other solid tumors. This recognition has come about thanks to standardized visual reporting guidelines, which helped to reduce inter-reader variability. Now, there are ripe opportunities to employ computational methods that extract spatio-morphologic predictive features, enabling computer-aided diagnostics. We detail the benefits of computational TILs assessment, the readiness of TILs scoring for computational assessment, and outline considerations for overcoming key barriers to clinical translation in this arena. Specifically, we discuss: 1. ensuring computational workflows closely capture visual guidelines and standards; 2. challenges and thoughts standards for assessment of algorithms including training, preanalytical, analytical, and clinical validation; 3. perspectives on how to realize the potential of machine learning models and to overcome the perceptual and practical limits of visual scoring
Recommended from our members
Pitfalls in assessing stromal tumor infiltrating lymphocytes (sTILs) in breast cancer
Abstract: Stromal tumor-infiltrating lymphocytes (sTILs) are important prognostic and predictive biomarkers in triple-negative (TNBC) and HER2-positive breast cancer. Incorporating sTILs into clinical practice necessitates reproducible assessment. Previously developed standardized scoring guidelines have been widely embraced by the clinical and research communities. We evaluated sources of variability in sTIL assessment by pathologists in three previous sTIL ring studies. We identify common challenges and evaluate impact of discrepancies on outcome estimates in early TNBC using a newly-developed prognostic tool. Discordant sTIL assessment is driven by heterogeneity in lymphocyte distribution. Additional factors include: technical slide-related issues; scoring outside the tumor boundary; tumors with minimal assessable stroma; including lymphocytes associated with other structures; and including other inflammatory cells. Small variations in sTIL assessment modestly alter risk estimation in early TNBC but have the potential to affect treatment selection if cutpoints are employed. Scoring and averaging multiple areas, as well as use of reference images, improve consistency of sTIL evaluation. Moreover, to assist in avoiding the pitfalls identified in this analysis, we developed an educational resource available at www.tilsinbreastcancer.org/pitfalls
- …